Model Selection

Efficient Pre-training

# Efficient Pre-training

Open-Qwen2VL is a multimodal model capable of receiving both images and text as input and generating text output.

Image-to-Text English

Llama3 German 8B 32k

A German-optimized large language model based on Meta Llama3-8B, continuously pre-trained on 65 billion German tokens, specifically optimized for German and supporting 32k long context

Large Language Model

Transformers German

TinyLlama is a small language model with 1.1 billion parameters, adopting the same architecture and tokenizer as Llama 2, suitable for resource-constrained application scenarios.

Large Language Model

Transformers English

VideoMAE is a video self-supervised pretraining model based on Masked Autoencoder (MAE), which learns internal video representations by predicting pixel values of masked video patches.

Video Processing

Chinese Electra Large Generator

Chinese ELECTRA is a pre-trained model developed by the HIT-iFLYTEK Joint Lab based on Google's ELECTRA model, featuring a small parameter size but superior performance.

Large Language Model

Transformers Chinese

Distilcamembert Base

DistilCamemBERT is a distilled version of the French CamemBERT model, significantly reducing model complexity while maintaining performance through knowledge distillation techniques.

Large Language Model

Transformers French

Chinese Mobile Bert

This model was pre-trained on a 250-million-word Chinese corpus using the MobileBERT architecture, with a training period of 15 days, completing 1 million iterations on a single A100 GPU.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase